[MXNET-1359] Adds a multiclass-MCC metric derived from Pearson #14461

tlby · 2019-03-18T21:14:09Z

Description

A multiclass metric equivalent to mxnet.metric.MCC can be derived from mxnet.metric.PearsonCorrelation with the addition of an .argmax() on preds. I'd like to document this use case of Pearson and provide it behind a metric named "PCC" to simplify extending examples from F1 and MCC to multiclass predictions.

Checklist

Essentials

Please feel free to remove inapplicable items for your PR.

The PR title starts with [MXNET-$JIRA_ID], where $JIRA_ID refers to the relevant JIRA issue created (except PRs with tiny changes)
Changes are complete (i.e. I finished coding on this PR)
All changes have test coverage:
Unit tests are added for small changes to verify correctness (e.g. adding a new operator)
Nightly tests are added for complicated/long-running ones (e.g. changing distributed kvstore)
Build tests will be added for build configuration changes (e.g. adding a new build option with NCCL)
Code is well-documented:
For user-facing API changes, API doc string has been updated.
For new C++ functions in header files, their functionalities and arguments are documented.
For new examples, README.md is added to explain the what the example does, the source of the dataset, expected performance on test set and reference to the original paper if applicable
Check the API doc at http://mxnet-ci-doc.s3-accelerate.dualstack.amazonaws.com/PR-$PR_ID/$BUILD_ID/index.html
To the my best knowledge, examples are either not affected by this change, or have been fixed to be compatible with this change

Changes

Feature1, tests, (and when applicable, API doc)
Feature2, tests, (and when applicable, API doc)

Comments

If this change is a backward incompatible change, why must this change be made.
Interesting edge cases to note here

tlby · 2019-03-19T00:01:51Z

hmm, actually numpy.corrcoef() may not be working how I need for the multiclass class. This work is based on the formulas here which relate Matthews to Pearson, but I may need to calculate the covariance myself.

And of course this PR should have some tests added. I'll work through both these issues.

pinaraws · 2019-03-19T21:39:27Z

Thank you for your contribution @tlby!
Some tests are failing, please have a look.

@mxnet-label-bot add[pr-work-in-progress]

tlby · 2019-03-20T23:29:20Z

diff --git a/python/mxnet/metric.py b/python/mxnet/metric.py
index 2a33cf4d9..6de76cc64 100644
--- a/python/mxnet/metric.py
+++ b/python/mxnet/metric.py
@@ -1576,9 +1576,8 @@ class PCC(EvalMetric):
             n = max(pred.max(), label.max())
             if n >= self.k:
                 self._grow(n + 1 - self.k)
-            bcm = numpy.zeros((self.k, self.k))
-            for i, j in zip(pred, label):
-                bcm[i, j] += 1
+            ident = numpy.identity(self.k)
+            bcm = numpy.tensordot(ident[label], ident[pred].T, axes=(0,1))
             self.lcm += bcm
             self.gcm += bcm

seems more efficient for constructing the confusion matrix, but benchmarks worse. I'm new to NumPy though, anyone see a better approach?

tlby · 2019-03-21T17:38:00Z

I think this PR is ready now, sorry for posting the PR prematurely. I am happy with test coverage at this point, and happy with the metric scaling out to a large number of classes. The .update() method duration doubles when there are about 500 classes.

pinaraws · 2019-03-21T20:28:07Z

@mxnet-label-bot update[pr-awaiting-review]

tlby · 2019-03-22T03:44:59Z

diff --git a/python/mxnet/metric.py b/python/mxnet/metric.py
index 2a33cf4d9..7bc090a0a 100644
--- a/python/mxnet/metric.py
+++ b/python/mxnet/metric.py
@@ -1576,9 +1576,8 @@ class PCC(EvalMetric):
             n = max(pred.max(), label.max())
             if n >= self.k:
                 self._grow(n + 1 - self.k)
-            bcm = numpy.zeros((self.k, self.k))
-            for i, j in zip(pred, label):
-                bcm[i, j] += 1
+            k = self.k
+            bcm = numpy.bincount(label * k + pred, minlength=k*k).reshape((k, k))
             self.lcm += bcm
             self.gcm += bcm

is a bit faster when k is small, but scales to larger k poorly.

tlby · 2019-03-28T01:47:38Z

It turned out building the confusion matrix wasn't the hotspot in .update() anyway. Once I cleaned up the actual hotspots this code became comparable in speed to binary mcc.

piyushghai · 2019-04-09T00:42:20Z

@szha Can you help with the review of this PR ?

szha · 2019-04-10T01:59:32Z

@tlby thanks for the contribution. Great job.

…e#14461) * Adds a multiclass-MCC metric derived from Pearson * trigger ci

tlby requested a review from szha as a code owner March 18, 2019 21:14

tlby force-pushed the kMCC branch 2 times, most recently from 39580ff to 6f0a2f4 Compare March 19, 2019 20:31

tlby force-pushed the kMCC branch from 6f0a2f4 to 103e67f Compare March 19, 2019 22:17

marcoabreu added the pr-work-in-progress PR is still work in progress label Mar 19, 2019

tlby force-pushed the kMCC branch 2 times, most recently from 3d2c5de to 64dde20 Compare March 20, 2019 01:55

tlby force-pushed the kMCC branch from 64dde20 to 3bb93ba Compare March 21, 2019 14:40

marcoabreu added pr-awaiting-review PR is waiting for code review and removed pr-work-in-progress PR is still work in progress labels Mar 21, 2019

Adds a multiclass-MCC metric derived from Pearson

7fb48d2

tlby force-pushed the kMCC branch 2 times, most recently from b514fee to 7fb48d2 Compare March 25, 2019 16:11

trigger ci

9a51fcd

szha merged commit c6516cc into apache:master Apr 10, 2019

tlby deleted the kMCC branch April 10, 2019 18:23

haohuanw pushed a commit to haohuanw/incubator-mxnet that referenced this pull request Jun 23, 2019

[MXNET-1359] Adds a multiclass-MCC metric derived from Pearson (apach…

f956050

…e#14461) * Adds a multiclass-MCC metric derived from Pearson * trigger ci

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[MXNET-1359] Adds a multiclass-MCC metric derived from Pearson #14461

[MXNET-1359] Adds a multiclass-MCC metric derived from Pearson #14461

tlby commented Mar 18, 2019

tlby commented Mar 19, 2019

pinaraws commented Mar 19, 2019 •

edited

Loading

tlby commented Mar 20, 2019

tlby commented Mar 21, 2019

pinaraws commented Mar 21, 2019

tlby commented Mar 22, 2019

tlby commented Mar 28, 2019

piyushghai commented Apr 9, 2019

szha commented Apr 10, 2019

[MXNET-1359] Adds a multiclass-MCC metric derived from Pearson #14461

[MXNET-1359] Adds a multiclass-MCC metric derived from Pearson #14461

Conversation

tlby commented Mar 18, 2019

Description

Checklist

Essentials

Changes

Comments

tlby commented Mar 19, 2019

pinaraws commented Mar 19, 2019 • edited Loading

tlby commented Mar 20, 2019

tlby commented Mar 21, 2019

pinaraws commented Mar 21, 2019

tlby commented Mar 22, 2019

tlby commented Mar 28, 2019

piyushghai commented Apr 9, 2019

szha commented Apr 10, 2019

pinaraws commented Mar 19, 2019 •

edited

Loading